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ABSTRACT 

Spectra of high redshift QSOs show deep Gunn-Peterson absorptions on the 
blue sides of the Lya emissions hnes. They can be decomposed into components 
called Lya leaks, defined to be emissive regions in complementary to otherwise 
zero-fiuxed absorption gaps. Just hke Lya absorption forests at low redshifts, 
Lya leaks are both easy to find in observations and containing rich sets of sta- 
tistical properties that can be used to study the early evolution of the IGM. 
Among all properties of a leak profile, we investigate its equivalent width in this 
paper, since it is weakly affected by instrumental resolution and noise. Using 10 
Keck QSO spectra at 2 ~ 6, we have measured the number density distribution 
function n{W, z), defined to be the number of leaks per equivalent width W and 
per redshift z, in the redshift range 5.4 — 6.0. These new observational statistics, 
in both the differential and cumulative forms, fit well to hydro numerical simula- 
tions of uniform ionizing background in the ACDM cosmology. In this model, Ly 
a leaks are mainly due to low density voids. It supports the early studies that 
the IGM at z ~ 6 would still be in a highly ionized state with neutral hydrogen 
fraction ~ lO""^. Measurements of n{W, z) at z > 6 would be effective to probe 
the reionization of the IGM. 

Subject headings: cosmology: theory - intergalactic medium - large-scale struc- 
ture of the universe 



1. Introduction 

The absorption spectra of QSOs at low redshift show Lya forests, which have played 
an important role to understand the physical status of diffused cosmic baryon gas and the 
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ionizing background. At redshift z > 5, however, they are no longer to show forests features, 
but consist of complete absorption troughs separated by spikes of the transmitted flux (e.g. 
Becker et al. 2001; Fan et al. 2006). That is, although the cosmic hydrogen gas at z > 5, in 
average, is opaque for Lya photons, there are many tiny regions, which are Gunn-Peterson 
transparent and lead to Lya photon leaking. 

The nature of the leaking is crucial to understand the physics of reionization. According 
to commonly accepted scenario of reionization, at early stage, only isolated patches around 
ionizing sources are highly ionized. The subsequent growing and overlapping of the ionizing 
patches lead to a uniform ionizing background and the end of reionization (e.g., Ciardi et al. 
2003; Sokasian et al. 2003; Gnedin 2004; McUcma et al. 2006) The ionization fraction of the 
IGM and the ionizing radiation underwent an evolution from highly non-uniform patches 
to a quasi-homogeneous field. Before the patch-to-uniform transition, only ionized patches 
would be transparent to Lya photons. After the transition, the low density voids will also 
be Gunn-Peterson transparent. Therefore, the origin of Lya leaks will constrain the epoch 
of the patch-to-uniform transition. In this Letter, we study the origin of Lya leaks in the 
observed spectra of QSOs at ^ ~ 6. 

Several statistics have been introduced to describe the transmitted flux of Lya absorp- 
tion at high redshifts, including the probability distribution function (PDF) of the flux (Fan 
et al. 2002; Becker et al. 2007), the distribution of the size of dark gaps (Songaila & Cowie 
2002; Fan et al. 2006), and the largest peak width distribution (Gallcrani et al. 2007). We 
will focus on the profile of the leaking features in the transmitted fiux. We fit these statistical 
features with samples of hydrodynamic simulation with a uniform ionizing background, and 
analyze the possibility of explaining the leaks by ionized patches embedded in neural IGM 
background. 

2. Samples 

1. Observational spectra of high redshift QSOs. The observational spectra used here are 
10 of the 12 Keck spectra of QSOs at redshift z > 5.8 compiled in Fan et al. (2006). We 
excluded 2 BAL QSOs. The data have a uniform resolution of ~ 4000 and are re-binned 
to a resolution it! = 2600. To avoid the mixing of Ly/3 absorption and the effect of QSO's 
H II region, only the rest frame wavelength between 1050 to 1170 A are used. To study 
the evolution of Lya leaks, we divide the spectra into two redshift bins, 5.4 < z < 5.7 
and 5.7 < z < 6.0. The observed flux, fobs, is normalized with a power-law continuum 
fcon i'^^'^. The noise level of transmitted flux F = fobs/fcon is about 0.018 ± 0.012 and 
0.014 ± 0.008 for above two redshift bins. For more details, we refer to Fan et al. (2006). 
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2. Simulation samples. We simulate Lya absorption spectra with a hybrid gas/dark 
matter code based on WENO scheme (Feng et al. 2004). The simulation is performed in a 
comoving box of 100/i~^Mpc with a 512^ mesh. We use the concordance ACDM cosmology 
model with parameters to be f)m=0.27, f2b=0.044, fl\— 0.73, h=0.71, (78=0.84, and spectral 
index n = 1. 

It has been shown that the observed dramatic decrease and abnormally large scatter 
of Gunn-Peterson optical depth at 2; ~ 6 (Fan et al. 2006) can be well fitted by models 
of a uniform ionizing background (Lidz et al. 2006; Liu et al. 2006). That is, the large 
scattering of Gunn-Pctcrson optical depth may still be mainly due to the inhomogeneity of 
the IGM density field. Therefore, we investigate whether such a uniform ionizing background 
can explain the leaks. In this context, the uniform photoionization rate is adjusted to yield 
the same mean optical depth as observational data at the redshifts considered. A thermal 
energy of T = 3 x lO^K is added at 2; = 10, and only adiabatic coohng and shock heating 
are followed. With this method, the photoionization rate are found to be equal to 0.63 
and 0.33 in units of lO^^^s^^ at redshift z =5.55 and 5.85, and the corresponding neutral 
hydrogen fraction are 3 x 10^^ and 7 x 10~^. The simulated spectra were smoothed with a 
Gaussian window of a FWHM corresponding to i? ~ 4000, and re-binned to pixels of the 
size of = 2600. We add Gaussian noises to the re-binned fluxes with variances equal to 
the observational noise level. 



3. Statistics of Lya leaks 

1. Identification of Lya leaks. 

Lya leaks are identified as contiguous pixels where the fiuxes have a maximum larger 
than 2 or 3 (7 of the local noise level. The boundaries of a leak are defined as positions where 
the fiuxes are smaller than a threshold Fj/j, or are the minimum between the neighboring 
leaks. That is, if there are two local maximums above 2 or 3 cr of the noise level, each one is 
identified as a leak. We take the threshold Ffh = 0.02 in this letter. Note the identification 
of a leak depends mostly on the condition of the maximum fiux (see discussion on Figure 

1 below). With this method we decompose the transmitted fluxes between Gunn-Peterson 
troughs into Lya leaks of different proflles. The Lya leaks contain information different from 
the size of dark gaps and largest peak width, both of which measure only length scales. 

To test the identiflcation condition, we count the number of fake leaks due to noise in 
100 simulation samples. The percentage of fake leaks are 2.3%(13%) and 0.4%(1.4%) for the 

2 a and 3a identiflcation in redshifts range 5.4 — 5.7(5.7 — 6.0). Similarly, we also count the 
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number of missed leaks due to noise. The percentage of such missing leaks are 5% (8%) and 
15%(22%) for the 2(3) a in redshifts range 5.4 — 5.7(5.7 — 6.0), respectively. The fluxes of 
missing leaks generally are around Fth- Therefore, the leak identification with Fth = 0.02 
is statistically reliable. In the 10 Keck spectra, there are totally 173 and 147 leaks in the 
redshift range 5.4 < z < 5.7, and 39 and 32 leaks in 5.7 < z < 6.0 for the 2 and 3 a 
identification, respectively. The fluxes of smallest leaks are a little higher than F = 0.02, 
while big leaks can have F ~ 0.3. 

2. Equivalent width functions. 

Similar to emission and absorption lines, we can measure the profile of Lya leak with 
equivalent width, which is defined as the area under its fiux profile, W = f Fd\, where 
the integral is over the range between the boundaries. For our observed samples, W lasts 
in the range from 0.06 A to about 5 A. In general, the equivalent width W measures the 
strength of the leaking, or the Gunn- Peterson optical depth within the leaking regions. The 
statistical description we used is the equivalent width function n{W, z), which is the number 
of leaks of W at redshift z per unit W per unit z. The equivalent width function reflects the 
distribution of the strength of leaking. 

We count the observed W into 15 bins with logarithm size Aln = (1/15) ln(10/0.01). 
The results are shown in Figure 1, which is for leaks at redshifts 5.4 — 5.7 (top) and 5.7 — 6.0 
(bottom), and the 2 a (left) and 3 a (right) identiflcation. The error bar is of Poisson 
fluctuation. The functions n{W, z) are weakly dependent on the identiflcation. Although 
the total numbers of leaks of 2a and 3(7 samples are different, the shape of n{W, z) for both 
samples are about the same. As expected, for large leaks oiW > 0.5 A, the functions n{W, z) 
are independent of the 2 or 3(T condition, while for small leaks oiW< 0.5 A, the function 
niW^i z) of 3cr is a little lower than that of 2cr. 

Figure 1 also shows the results given by 100 simulation samples. The solid curves are 
the mean of the samples, and the dot lines give the jackknife error estimator, which is to 
divide the 100 samples into 10 subsamples and compute the variance over the 10 subsamples. 
We see that the distributions of n{W^ z) of simulation samples generally are good to fit the 
observed samples. To test the effect of noise, we also calculated the function n{W, z) of 
simulation samples without noise addition, and the results are shown in Figure 1 as the 
dashed lines. Without noise addition, the leaks are identified as local maximums above 
Ft^=0.02. Figure 1 shows that the noise has no effect on big leaks {W > 0.5 A). While for 
small leaks (W < 0.5 A), samples without noise addition give higher number of leaks than 
samples with noise addition. This is because the identification condition of 2 and 3 a is more 
rigorous than the condition of Fth=0.02. Therefore, the effect of noise on n{W, z) does not 
change the consistence between observed and modeled n{W,z). The measurement of W is 
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insensitive to the noise, as it is on the area of profile. 

We see from Figure 1 that a few data points at small W show fluctuation around 
simulation result. It is probably caused by the binning. To solve this problem, we calculate 
the cumulative equivalent width function defined as n{> W,z) — n{W, z)dW , which is 
less dependent on the binning. Since the distributions of leaks of 2 and 3 a identification are 
similar, only 2 a identification condition is applied. The results are presented in Figure 2. 
The solid curves are the mean of simulated samples, and the dot lines give the jackknife error 
estimator as in Figure 1. It shows clearly that the cumulative width functions of observed 
leaks are smooth, and gives a better fitting with simulation samples. 

Figure 3 presents n{> W,z) vs. z for leaks of IF = 0.4,1, and 1.6A . The redshift- 
evolution of leaks with larger W is more significant than smaller W leaks. This is natural 
in low density voids scenario. The larger voids have lower probability, or are the events 
given by the tail of the PDF of voids. The PDF tail underwent a strong evolution at high 
redshifts. At redshift z > 6, there are only very few leaks identified from observation data, 
and therefore, we will not extend the analysis to z > 6. 

Since all leaks in simulated spectra arc due to low density voids, the results show that the 
distribution of observed leaks are consistent with low density voids assuming the uniform 
ionizing background. It is interesting to point out that the tail of the cumulative width 
function shown in Figure 2 is close to Gaussian distribution with respect to logarithm W. 
Therefore, n(> W, z) approximately has a lognormal tail of W. 

3. Ionized patches. 

We now estimate the Lya leaking due to the ionized patches around ionizing sources. 
Considering a simple model, ionizing sources embed in a fully neutral IGM at high redshift. 
The scale of ionizing patches can be estimated with radius R = — exp(— t/trec)]^^^, 
where Rs is the Stromgren sphere radius, tree and t are, respectively, the recombination time 
and the active age of the ionizing source. It has been shown that, due to the retardation 
effect of photon propagation, the scale R actually is an upper limit to the ionized volume 
(Shapiro et al. 2006; Qiu et al. 2007a). The retardation effect is more apparent for clustered 
sources (Qiu et al. 2007b). Moreover, it is also shown that the fraction of H I within ionized 
sphere generally is larger than 10~^ unless the intensity of sources > 10^^ sec~^ (Qiu et 
al. 2007a). 

It has been shown that the damping wing of the neutral IGM absorption make ionized 
patches opaque to Lya; photons if the size is too small (Miralda-Escudc 1998). This effect 
is more significant if a small fraction of H I is remained in patches. For instance, an ionized 
patch with neutral fraction of 5 x 10~^ around a galaxy &t z = Q can yield a fiux F = 0.02 
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only if the comoving radius R > 3.5 /i~^Mpc, or TV > 9 x 10^^ sec~^, which requires a 
luminosity L > 1.6 x lO^^L© if assuming a spectra of L^, oc z/~^. Here we also assume all 
the ionizing radiation of a galaxy is capable to contribute to the ionizing sphere, and the 
luminosity L > 1.6 x lO^^L© gives a lower limit to the required luminosity to produce a leak 
with F = 0.02. 

With these results, one can estimate the number of leaks with F > 0.02 due to galaxies 
by using the luminosity function of galaxies at 2; = 6 (Bouwens et al. 2006). The probability 
that a line intercepts patches at an comoving impact parameter r = 1.5 h^^Mpc (since the 
cross radius should be larger than 3.5h~^Mpc, we should use a smaller impact radius) for 
galaxies with luminosity > 1.6 x 10^^ Lq is (e.g., Peebles 1993) 

dN _ 7rrV(> 1.6 x 10^^Lq)c 

dz H{z) ^ ' ^' 

here 0(> 1.6 x 10^^ Lq) is the comoving number density of galaxies with luminosity > 1.6 x 
1O^°L0. On the other hand. Figures 1 and 2 show that the number density of leaks with 
F > 0.02 at 5.7 < 2; < 6 is ~ 35. Therefore, if the IGM 2; = 6 is mostly neutral, and the only 
ionized regions are the patches around galaxies, the leaks of F > 0.02 given by the ionized 
patches of galaxies would be no more than 20% of the observed result. 



4. Discussions and conclusions 

The transmitted fluxes between Gunn-Peterson troughs of high redshift QSO's absorp- 
tion spectra are not only one peak of the flux, but contain rich structures, which can be 
decomposed into Lya leaks. The Lya leaks have profiles similar to emission lines, and can 
be measured by equivalent width W. The equivalent width functions, n{W, z) are effective 

statistical measurement of the process of reionization. We show that the equivalent width 
functions of the observed spectra at redshifts 5.4 < z < 6.0 can be well fitted by hydro 
simulation of the ACDM cosmology assuming the ionizing background to be uniform. In 
this model, all the Lya leaks are leaking through low density voids. 

The mean transmitted flux at z is given by F cx / n{W, z)WdW. Therefore, by adjusting 
the photoionization rate to match the observed Gunn-Peterson optical depth, the mean of 
W for simulation samples should be the same with observation. Thus, Figure 1 actually is 
to show that once we adjusted the mean flux to be the same as observation, the simulation 
yields the same distribution of the observed W. In other word, the scattering of W is caused 
by the fluctuations of mass density field of HI. Therefore, a small inhomogeneity of the 
ionizing background would be allowed. That is, the distribution of W would still be able to 
be fitted with a fluctuated ionizing background if its variance is much less than that of HI. 
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In addition to the distribution of W, the evolution of W is also helpful when differen- 
tiating models. For example, in voids scenario, the evolution of W reflects the evolution of 
low density voids; while for ionized patches, it reflects the evolution of the UV luminosity 
function of ionizing sources. 

We show that the ionized patches of galaxies embedded in a fully neutral IGM at redshift 
2; ~ 6 are not enough to produce the observed leaks. We also show that leaks can only be 
produced by patches around strong ionizing UV photon sources, but not weak sources. 
Especially, big leaks {W > 0.5 A, or F > 0.1) have to come from very strong sources. 
Therefore, at higher redshift, Lja leaks only probe strong ionizing sources. Thus, from the 
existence of many big leaks at 2; < 6 we can conclude that the patch-to-uniform transition 
of the ionizing background would occur at 2; > 6, and most of the IGM at 2; ~ 6 are still 
in a highly ionized state of neutral fraction /hi — 10~^. This result is consistent with the 
analysis of the transmitted flux PDF (Becker et al. 2007), the QSO proximity zones (Lidz 
et al. 2007) and the luminosity function of Lya emitting galaxies (Dijkstra et al. 2007). 

It should be pointed out that the resolution of the observed data is low, ~ 3 A, which 
corresponds to a comoving size ~ 0.7 h~^Mpc. In contrast, most of the simulated leaks 
possess a intrinsic width < SA. Thus the low resolution data provide only a test of smoothed 
leaking features. One cannot see whether the smoothed features is due to individual or 
clustered leaks. Higher resolution spectra would be able to test not only the width functions, 
but also the spatial correlations of the leaks. They can also provide other measurements of 
Lya leaks, like the FWHM, which would be effective for a confrontation between real data 
and models. 

The statistics of Lya leaks at redshifts < 6 actually is the statistics of voids formed in 
the early universe. The equivalent width functions, n(W, z) of Lya leaks are similar to the 
mass function of galactic clusters. Thus, one can expect that the width functions of voids 
are sensitively dependent on cosmological parameters, and play a similar role as the mass 
function of clusters. For instance, the formation of large voids is found to be sensitively 
dependent on the mass parameter (Miranda & de Araujo 2001). With data of leaks, 
we may set constraint on cosmological parameters at high redshifts. This approach will be 
reported separately. 
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Fig. 1. — Equivalent width function n{W, z) for leaks of 2cr (left) and 3 a (right) identification 
at redshift ranges z — 5.4 — 5.7 (top) and 5.7 — 6 (bottom). The data points are from 10 Keck 
QSO spectra. The error bars are from Poisson fiuctuation. The solid lines are calculated 
with 100 simulated spectra; dot lines are the range of variance over the 10 subsets, each of 
which contains 10 spectra; dash lines are for samples without noise. The noise has little 
effects on equivalent width W. 
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Fig. 2. — Cumulative width function n{> W, z) for leaks at redshift ranges z = 5.4 — 5.7 
(top) and 5.7 — 6 (bottom). The data points are from 10 Keck QSO spectra. The solid lines 
are from 100 simulated spectra; dot lines are the range of variance over the 10 subsets, each 
of which contains 10 spectra. 




Fig. 3. — The redshift evolution of cumulative width function n{> W, z) of leaks of 1^ = 
0.4, 1, and 1.6A (top). The data are taken from Figure 2. The larger leaks evolve faster. 



